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ABSTRACT 

The Pan-STARRSl survey is obtaining multi-epoch imaging in 5 bands 
(fl'pi'^pi^pi^piZ/pi) over the entire sky North of declination — 30deg. We de- 
scribe here the implementation of the Photometric Classification Server (PCS) 
for Pan-STARRSl. PCS will allow the automatic classification of objects into 
star /galaxy /quasar classes based on colors, the measurement of photometric 
redshifts for extragalactic objects, and constrain stellar parameters for stellar 
objects, working at the catalog level. We present tests of the system based 
on high signal-to-noise photometry derived from the Medium Deep Fields of 
Pan-STARRSl, using available spectroscopic surveys as training and/or verifica- 
tion sets. We show that the Pan-STARRSl photometry delivers classifications 
and photometric redshifts as good as the Sloan Digital Sky Survey (SDSS) pho- 
tometry to the same magnitude limits. In particular, our preliminary results, 
based on this relatively limited dataset down to the SDSS spectroscopic limits 
and therefore potentially improvable, show that stars are correctly classified as 
such in 85% of cases, galaxies in 97% and QSOs in 84%. False positives are less 
than 1% for galaxies, ~ 19% for stars and ~ 28% for QSOs. Moreover, photo- 
metric redshifts for 1000 luminous red galaxies up to redshift 0.5 are determined 
to 2.4% precision (defined as 1.48 x Median\zpfiQi — Zgpec 

\/{l + z)) with just 0.4% 
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catastrophic outliers and small (-0.5%) residual bias. For bluer galaxies up to 
the same redshift the residual bias (on average -0.5%) trend, percentage of catas- 
trophic failures (1.2%) and precision (4.2%) are higher, but still interestingly 
small for many science applications. Good photometric redshifts (to 5%) can be 
obtained for at most 60% of the QSOs of the sample. PCS will create a value 
added catalog with classifications and photometric redshifts for eventually many 
millions sources. 

Subject headings: Galaxies: active; Galaxies: distances and redshifts; Stars: 
general; Surveys:Pan-STARRSl 



1. INTRODUCTION 



Pan-STARRSl, the prototype of the Panoramic Survey Telescope and Rapid Response 
System, started scientific survey operations in May 2010 and is producing a 5 band ((^pirpiipi- 
z-piUvi) imagin g for 3/4 of the sky that will be ^ 1 mag deeper than the Sloan Digital Sky 
Survey (SDSS, lYork et al. II2OOOI : lAbazajian et al. II2009I ) at the end of the foreseen 3 years 
of operations. Approximately two hundred million galaxies, a similar number of stars, about 
a million quasars and ~ 7000 Type la Supernovae will be detected. In addition to the search 
for so-called "killer asteroids", the science cases driving Pan-STARRSl are both galactic and 
extragalactic. Extragalactic goals range from Baryonic Acoustic Oscillations and growth of 
structure, to weak shear, galaxy-galaxy lensing and lensing tomography. All rely on the 
determination of accurate photometric redshifts for extremely large numbers of galaxies. 
The galactic science cases focus on the search for very cool stars and the structure of the 
Milky Way, requiring good star/galaxy (photometric) classification and constraints on stellar 
parameters. Further science goals, such as the detection of high redshift quasars and galaxies, 
quasar/quasar and quasar/galaxy clustering, or the study of how galaxies evolve with cosmic 
time profit from the availability of good photometric reds hifts and star /galaxy photometric 
classification. Therefore in the last years we have designed ( Saglia IboOS : Snigula et al. 1 120091 ) 
and implemented the Photometric Classification Server (PCS) for Pan-STARRSl to derive 
and administrate photometric redshift estimates and probability distributions; star /galaxy 
classification and stellar parameters for extremely large datasets. The present paper describes 
the system and its performances in various tests. 

The structure of the paper is as follows. The Pan-STARRSl system is sketched out 
in Section [2l the observations we used are described in Section [3] and data processing is 
outlined in Section HI Section |5] presents the PCS system, its algorithms and components, 
and the implementation. Section |6] discusses the tests of the system, which is followed by 
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our conclusions in Section [3 



2. The Pan-STARRSl Telescope, Camera, and Image Processing 



The Pan-STARRSl system is a high-etendue wide- field imaging system, designed for 
dedicated survey operations. The system is installed on the peak of Haleakala on the island 
of Maui in the Hawaiian island chain. Routine observations are conducted remotely, from 
the Advanced Technology research Center in Pukalani. We provide below a terse summary 



STARRSl system, both hardware and software, is provid ed bv iKaiser et a. 



survey philosophy and execution strategy are described by [Chambers et al. 
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The Pan-STARRSl optical design ( iHodapp et al. II2004I ) uses a 1.8 meter diameter pri- 
mary mirror, and a 0.9 m secondary. The resulting converging beam then passes through two 
refractive correctors, a 48 cm x 48 cm interference filter, and a final refractive corrector that 
is the dewar window. The entire optical system delivers an //4.4 beam ar id an image with 



a dia meter of 3.3 degrees, with low distortion. The Pan-STARRSl imager (ITonry &: Onaka 



20091 ) comprises a total of 60 detectors, with 4800 x 4800 10 /im pixels that each subtend 
0.258 arcsec. The detectors are back-illuminated CCDs, manufactured by Lincoln Labora- 
tory, and read out using a Star Grasp CCD controller, with a readout t ime of 7 seconds for a 
full unbinned image. Initial performance assessments are presented by lOnaka et al. I (120081 ). 



The Pan-STARRSl observations are obtained through a set of five broadband filters, 
which we have designated as gpi, rpi, ipi, zpi, and ypi. Under certain circumstances 
Pan-STARRSl observations are obtained with a sixth, "wide" filter designated as wpi that 
essentially spans gpi, rpi, and ipi. Although the filter system for Pan-STARRSl has much 
in common with that used in previous surveys, such as the SDSS, there are important differ- 
ences. The gpi filter extends 20 nm redward of gsDss^ paying the price of 5577A sky emission 
for greater sensitivity and lower systematics for photometric redshifts, and the zpi filter is 
cut off at 930 nm, giving it a different response from zsdss- SDSS has no corresponding 
ypi filter, while Pan-STARRSl is lacking u-band ph otometry that S DSS provides. Further 



information on the passband shapes is described in iStubbs et al. I (120101 ). Provisional re- 
sponse functions (including 1.3 airmasses of atmosphere) are available at the project's web 
site0. Photometry is in the "natural" Pan-STARRSl system, m = —2.5log{flux) +m\ with 
a single zeropoint adju stment m' made in each band to conform to the AB magnitude scale 



(ITonry et al. I lin prepi ) . Pan-STARRSl magnitudes are interpreted as being at the top of 



http://svn.pan-starrs.ifa.hawaii.edu/trac/ipp/wiki/PSl_Photonietric_System 
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the atmosphere, with 1.3 airmasses of atmospheric attenuation being included in the system 
response function. No correction for Galactic extinction is applied to the Pan-STARRSl 
magnitudes. We stress that, like SDSS, Pan-STARRSl uses the AB photometric system 
and there is no arbitrariness in the definition. Flux representations are limited only by how 
accurately we know the system response function vs. wavelength. 

Images obtained b y the Pan-STA RRSl system are processed through the Image Pro- 
cessing Pipeline (IPP) (jMagnierll2006l ). on a computer cluster at the Maui High Performance 
Computer Center. The pipeline runs the images through a succession of stages, including 
flat-fielding ("de-trending"), a flux- conserving warping to a sky-based image plane, masking 
and artifact removal, and object detection and photometry. The IPP also performs image 
subtraction to allow for the prompt detection of variables and transient phenomena. Mask 
and variance arrays are carried forward at each stage of the IPP processin g. Photometric 
and astrometric measurern ents performed by the IPP system are described in lMagnieii (120071 ) 
and iMagnier et al. I ( l2008l ) respectively. 



The details of the photometric cali bration and the Pan-S TAR RSl zeropoint scale will be 
presented in a subsequent publication (ITonry et al. Ilin prepi ). and iMagnier et al. I (lin prepi ) 
will provide the application to a consistent photometric catalog over the 3/4 sky observed 
by Pan-STARRSl. 



3. Observations 



This paper uses images and photometry from the Pan-STARRSl Medium-Deep Field 
survey. In addition to covering the sky at 5 > —30 deg in 5 bands, the Pan-STARRSl survey 
has obtained deeper multi-epoch images in the Pan-STARRSl gpi, rpi, ipi, zpi and ypi 
bands of the fields listed in Table [TJ The typical Medium-deep cadence of observations is 
8 X 113 s in the gpi and rpi bands the first night, 8 x 240 s in the ipi band the second night, 
8 X 240 s in the zpi band the third night, 8 x 113 s in the ^^pi and rpi bands in the forth 
night, and on each of the 3 nights on either side of Full Moon 8 x 240 s in the ypi band. The 
5 a point source detection limits achieved in the various 5'pirpiipi2;pi?/pi bands, as well as 
other statistics of potential interest, are presented in Table |2] for the co-added stacks. They 
represent the depth of stacks at the time of writing, as observations are still on-going. In 
the following we will only consider the fields MDF03 to 10, that overlap with SDSS. 
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Table 1: Pan-STARRSl Medium-Deep Field Centers. 



Field 


Alternative names 


RA (degrees, J2000) 


Dec (degrees, J2000) 


MDFOl 


MDOl, PSl-MDOl 


35.875 


4.250 


MDF02 


MD02, PS1-MD02 


53.100 


-27.800 


MDF03 


MD03, PS1-MD03 


130.592 


44.317 


MDF04 


MD04, PS1-MD04 


150.000 


2.200 


MDF05 


MD05, PS1-MD05 


161.917 


58.083 


MDF06 


MD06, PS1-MD06 


185.000 


47.117 


MDF07 


MD07, PS1-MD07 


213.704 


53.083 


MDF08 


MD08, PS1-MD08 


242.787 


54.950 


MDF09 


MD09, PS1-MD09 


334.188 


0.283 


MDFIO 


MDIO, PSI-MDIO 


352.312 


-0.433 



4. Data processing 



The Pan-STARRSl IPP system performed fiatfielding on each of the individual images, 
using white light fiatfield images from a dome screen, in combination with an illumination 
correction obtained by rastering sources across the field of view. Bad pixel masks were 
applied, and carried forward for use at the stacking stage. After determining an initial as- 
trometric solution, the fiat-fielded images were then warped onto the tangent plane of the 
sky, using a flux conserving algorithm. The plate scale for the warped images is 0.200 arc- 
sec/pixel. The IPP software for stacking and photometry is still being optimized. Therefore, 
for this paper we generate stacks using customized software from one of us ( JT) and produce 
aperture photometry catalogs running Sextractor (iBertin fc Arnoux Ill996l ) on each stacked 
image independently. At a second stage we match the catalogs requiring a detection in each 
band within a one arcsec radius. For simplicity we use a rather large aperture radius (7.4 
arcsec) for our photometry and we do not apply any seeing correction, even if for some of 
the Medium-Deep fields slight variations in the FWHM between the filters are observed (see 
Table In the production mode of operations IPP will provide stacks with homogenized 
PSF, where forced photometry will be performed at each point where a detection in one of 
the unconvolved stacks is re ported. The ca talogs will be ingested in the Published Science 
Products Subsystem (PSPS. iHeasley II2008I ). the database that will serve the scientific com- 
munity with the final Pan-STARRSl products. Note that at present PCS uses fluxes and 
flux ratios, but no morphological information, such as spatial extent or shape. 
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Table 2: Pan-STARRSl MDF Statistics, Apr 2009-Apr 2011. 







1 V 


logr 


PSF 






r iciu. 


r lilt;! 


1 V 


iOg I 


r O r 






lyLLJc Ul 


fl'Pl 


AO 


A 7 

4. / 


1 


25 


l.OO 


OA 
Z4.0 


iviiyr uo 


5'pi 


"^8 
Oo 


A fi 
4.0 


1 0^ 
l.ZO 


l.OO 


OA A 
Z4.4 


WLLJC Ul. 




AO 


4 7 

4. / 


1 


15 


l.OO 


94 4 

Z4.4 


iviur uo 


'"PI 


oy 


4 R 
4.D 


118 
l.lo 


1 A^ 
1.40 


94 9 
Z4.Z 


A/rF>T7ni 


ipi 


41 
41 


4 Q 
4.y 


1 


05 


1 97 
l.Z ( 


94 4 
Z4.4 


iviur uo 


ipi 


41 
41 


4 Q 
4.y 


114 
1.14 


1 "^Q 

i.oy 


OA 

Z4.0 


A/T"npni 

LXiLJr Ul 


Zpi 


41 


4 Q 
4.y 


1 


03 


1 94 
1 .Z4 


9^? Q 
zo.y 


iVll^T UO 


Zpi 


'?8 
Oo 


4 Q 
4.y 


1 n^i 
1 .uo 


1 '^n 
1 .ou 


9^? 7 
ZO. ( 


i\/T'nT?ni 


l/Pl 


91 
Zl 


/I P. 

4.0 





95 


1 1 7 
1 . 1 ( 


99 A 
ZZ.4 


iviur uo 


ypi 


OA 
Z4 


A 7 
4. ( 


1 nn 

l.UU 


1 9p; 
1 . zo 


99 A 
ZZ.4 


i\/r"nT?n9 
iviur uz 


9vi 


ou 


4 1^ 
4.0 


1 


31 


1 7Q 

1 . ( y 


9/1 9 
Z4.Z 


iviur u ( 


9pi 


00 


4 ^ 
4.0 


1 9^? 
l.ZO 


1 fi8 
l.Oo 


9/1 


i\/r"nTrn9 
iviur uz 




9Q 

zy 


4 

4.0 


1 


20 


1 7/1 
1.(4 


9/1 1 
Z4. 1 


iviur u / 


fpl 


oy 


4 1^ 
4.0 


1 1 

1.10 


1 /1R 
1.40 


OA 
Z^.Z 


WLUr UZ 




ou 


4 S 

4.0 


1 


11 


1 ^n 

l.OU 


94 


iviur u / 


ipi 


oy 


4 Q 
4.y 


114 
1.14 


1 44 
1.44 


OA 


WLLJc yjZi 




oo 


4 8 

4.0 


1 


06 


l.OU 


O'i fi 
zo.o 


iviur u / 


Zpi 


A'i 

40 


4 Q 
4.y 


1 08 
1 .Uo 


1 V? 


O'i 7 
zo. 1 


iviiyr uz 


ypi 


lO 


4 ^ 

4.0 


1 


14 


1 49 

1 .4Z 


99 1 

ZZ. 1 


iviiyr u / 


ypi 


ou 


4 8 
4.0 


1 ni 

1 .Ul 


1 98 
l.ZO 


00 
zz.o 


iviur uo 


gpi 


Oo 


4 fi 

4.0 


1 


18 


1 44 

1 .44 


94 

Z4.0 


iviiyr Uo 


gpi 


"^8 

Oo 


4 ^ 
4.0 


1 97 

l.Z / 


1 fi8 

l.Oo 


OA 

Z4.0 


ivijjr uo 




( 


A P. 
4.0 


1 


09 


1 98 
1 .Zo 


94 4 
Z4.4 


iviur Uo 


rpi 


'?8 
Oo 


4 ^ 
4.0 


114 
1.14 


1 47 
1.4 ( 


OA 
Z^.Z 


iviur Uo 


ipi 


/1 1 
41 


A Q 

4.y 


1 


uo 


1 Q1 

l.ol 


9/1 /I 
Z4.4 


iviur Uo 


ipi 


00 


A 8 
4.0 


1 07 
l.U ( 


1 'i/l 
1.04 


OA 
Z^.Z 


MDF03 




42 


5.0 


1 


03 


1.27 


23.9 


MDF08 




40 


4.9 


1.09 


1.39 


23.7 


MDF03 


ypi 


20 


4.6 


1 


00 


1.36 


22.4 


MDF08 


ypi 


32 


4.9 


0.98 


1.27 


22.7 


MDF04 


gpi 


35 


4.6 


1 


17 


1.52 


24.5 


MDF09 


gpi 


34 


4.5 


1.26 


1.55 


24.3 


MDF04 




37 


4.6 


1 


09 


1.46 


24.3 


MDF09 


rpi 


33 


4.5 


1.15 


1.42 


24.1 


MDF04 


ipi 


35 


4.9 


1 


07 


1.35 


24.3 


MDF09 


ipi 


34 


4.8 


1.02 


1.36 


24.3 


MDF04 


Zpi 


28 


4.8 


1 


03 


1.32 


23.6 


MDF09 


Zpi 


34 


4.8 


1.02 


1.26 


23.7 


MDF04 


ypi 


8 


4.3 


1 


03 


1.21 


22.0 


MDF09 


ypi 


12 


4.3 


0.94 


1.12 


22.0 


MDF05 


9pi 


42 


4.6 


1 


24 


1.58 


24.4 


MDFIO 


gpi 


30 


4.5 


1.26 


1.60 


24.2 


MDF05 


rpi 


40 


4.6 


1 


17 


1.46 


24.3 


MDFIO 


rpi 


33 


4.5 


1.18 


1.53 


24.2 


MDF05 


ipi 


34 


4.8 


1 


06 


1.44 


24.3 


MDFIO 


ipi 


30 


4.8 


1.01 


1.31 


24.2 


MDF05 


Zpi 


27 


4.8 





99 


1.27 


23.6 


MDFIO 


Zpi 


28 


4.8 


1.03 


1.24 


23.6 


MDF05 


ypi 


17 


4.6 


1 


02 


1.33 


22.3 


MDFIO 


ypi 


11 


4.4 


0.96 


1.22 


22.2 



Note. — iV is the number of nights of observation, logt is the logj^Q of the net exposure time in sec, 
"PS'F" is the DoPhot FWHM of the core-skirt PSF in the stack-stacks, (w) is the median IPP FWHM of 
the observations, and mum is the 5cr detection limit for point sources. 
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5. The PCS system 

The science projects described in the Introduction define broadly the requirements 
for the PCS. It should provide software tools to compute: (a) photometric, color-based 
star/QSO/galaxy classification (i.e., morphological classifiers based on sizes and shapes are 
not part of PCS, even if this additional information could and probably will be added in 
the future), (b) best fitting spectral energy distribution and photometric redshifts (photo-z) 
with errors for (reddish) galaxies. Furthermore (c), (a subset of the stellar parameters) best- 
fitting temperature, metallicity, gravity and interstellar extinction with errors for (hot and 
cool) stars should be provided. The codes should be interfaced to the PSPS database and to 
the dataserver of IPP and results written directly into PSPS (i.e. photo-z with errors) and 
into additional databases linked to the PSPS, dubbed MYDB. 

In the following, we first describe the algorithms that implement (a) and (b) (Section 
15. ip . then the system components (i.e. the different independent pieces of code that make 
PCS, Section [52]), and finally its implementation (Section [5. 3p . Point (c) is still under devel- 
opment and will be described in a future paper. Presently, (a) and (b) do not communicate 
with each other and work independently. We plan to upgrade the package in the future, 
merging in an optimal way the classification information coming from both approaches, and 
using it to improve the determination of photometric redshifts. 

PCS is designed to work with catalogs providing fiuxes in the gpi, rpi, zpi, zpi, ypi 
bands and their errors (further bands can be added if available through external datasets). 
Optimal performances are expected for objects with good photometry in all Pan-STARRSl 
bands, but (somewhat deteriorated) output can be obtained even in the absence of some 
bands or low signal-to-noise data. 

5.1. The algorithms 

5.1.1. SVM for PanDzSC 

There is an ongoing trend in astronomy towards larger and deeper surveys. The natural 
consequence of this is the advent of very large datasets. There is therefore a need for 
automated data handling, or data mining, techniques to handle this data volume. Automated 
source classifiers based on photometric observations can provide class labels for catalogues, 
or be used to recover objects for further study according to various criteria. 



The SPSS used a selection of algorithms t o classify catalogue objects and select fol- 
lowup targets ( lAdelman-McCarthy et al. II2006I ). Methods based on colour selection were 
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particularly employed for finding probable quasars ( iRichards et al. II2002I). Mor e recently, 



Gao. Zhang fc Zhao I fl2008f ). IRichards et al. I fl2009ar ) and IRichards et al. I fl2009bl ) have used 



Kernel density estimators for quasar selection in SDSS data. Galaxy classification is usu- 
ally primarily based on detecting source extension by comparing PSF magnitudes with 
magnitudes based o n var ious profile models. For examples of Galaxy classification see 
Vasconcellos et al. I (201ll ) who used decision trees for star-galaxy separation in SDSS, or 



Henrion et al. 



(l201ll). who u s ed a B ayesi an method for star- g alaxy separation in SDSS and 



Tzalmantza et al. I (120071 ) and iTzalmantza et al. I ( l2009l ) have developed a test 



UKIDSS. 

library and selection methods for identifying galaxies in the forthcoming Gaia mission. 



Lee et al. I (|2008[ ) identified various stellar populations for foUowup in the SEGUE survey 



from SDSS photometry. Other attempts to identify stellar populations from photometric 
data include ISmith et al. I (|2010| ). who in vestigated the u s e of s everal automated classifiers 
on SDSS data to identify BHB stars, and lKlement et al. I (1201 if ) who used a support vector 
machine to separate fie l d giari ts from dwarfs using photometric data from a range of surveys. 
Marengo and Sanchez I (120091 ) used a kNN technique to search for brown dwarfs in Spitzer 
data. 

Finally, it should be mentioned that there is a whole industry of finding ways to classify 
various types of vari able objects based on their photometric light curves. See for example 



orlSchmidt et al. 



Dubath et al. I (l201lf) for classification of various stellar types with a Random forest method. 



( 120101 ) who developed a method for separating quasars from variable stars 



based on a structure function fit. 

The Pan-STARRSl Discrete Source Classifier (PanDiSC) used here is based on a support 
vector machine (SVM), a statistical learning algorithm. SVM works by learning a nonlinear 
boundary to optimally separate two or more classes of objects. Here it takes as input the 
4 Pan-STARRSl gpi-rpi, rpi-ipi, ipi-zpi, zpi-ypi colors. PanDiSC is based on the Discrete 
Source Clas sifier under development f or the purpose of classifying low resolution spectroscopy 



from Gaia (jBailer- Jones et al. 1120081 ). The SVM runs in th e PanDiSC compone nt of PCS 



(see Section [5^ . The SVM implementation used is libSvm ( IChang &: Lin 1120111 ). available 



at http://www.csie.ntu.edu.tw/~cjlin/libsvm Probabilities are calculated by modeling the 
density of da ta points on either side of the decision boundary, according to the method of 
Piatt I (119991). and the multiclass probabilities are obtained by pairwise coupling, as described 
by lWu. Lin fc Weng I (120041 ). PanDiSC chooses the highest membership probability from the 
eventual output for each source and assigns the membership to the star/QSO/galaxy class 
accordingly. 



The SVM is trained on a sample of objects with Pan-STARRSl photometry and spectro- 
scopic classification, to which the parameters 7 (the scaling factor) and C (the regular isat ion 
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cost) of the ra dial basis functions (RBF) kernel are tuned using a downhi ll simplex algorithra 
flSmithll2009l ). The system has been applied to the SDSS DR6 dataset Jsiting et al. Iboosi ) 
producing an excellent confusion matrix (i.e. high accuracy of classification, > 96%, and low 
percentage of false positives, ~ 29% for the stellar catalog, 0.5% for the galaxy catalog, 10% 
for the QSO catalog). The code is a compiled Java program. 



5.1.2. PhotoZ for PanZ 



In the last decade several efficient codes for the determination of photometric red- 
shifts have been developed, based either on empirical methods, or template fitting. In the 
first case one tries to parametrize the low- dimensional surface in color-redshift space that 
galaxies occupy using low-order polynomials, nearest-neighbor searches or neural networks 
(jCsabai et al. Il2003t ICoUister fc Lahav Il2004j ). These codes extract the information directly 
from the data, given an appropriate training set with spectroscopic information. Template 
fitting methods wor k instead with a set of mode l spectra frorn obse r ved galaxies and stellar 



Ilbert et al. 2009; Pello et al 



20091 ) 



population models (IPadmanabhan et al. I l2005l : lllbert et al. I l2006l : iMobasher et al. 1 120071 : 



The PhotoZ code used in the PanZ compo nent of the PCS sys tem (see Section 15. 2p 



belongs to this last category and is described in iBender et al. I (120011 ). The code estimates 



redshifts z by comparing T, a set of discrete template SEDs, to the broadband photometry 
of the (redshifted) galaxies. For each SED the full redshift posterior probability function 
including priors for redshift, absolute luminosity, and SED probability is computed using 
Bayes' theorem: 

P{z, T\F, M, ...) cx p{F\z, T)p{z, T\M), (1) 

where F is the vector of measured fiuxes in different bands, M the galaxy absolute magnitude 
in the B band (see below), p{z,T\M) the prior distribution and p{F\z,T) cx exp(— x^/2) is 
the probability of obtaining a normalized fo^' the given dataset, redshift and template T. 
In detail, we compute as: 



X 



E 



{Wiafi^modY + Mldat 



(2) 



where fi,mod{z) and fi^dat are the fluxes of the templates (at the redshift z) and of the data 
in a fllter band i, and Afi^^at are the errors on the data. The model weights Wi quantify 
the intrinsic uncertainties of the SEDs for the speciflc fllter i, presently they are all set to 
Wi = 0.1. The normalization parameter a is computed by minimizing at each choice of 
parameters. 
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The priors are (products of) parameterized functions of the type: 



p{y) (X y"'exp 



-ln{2) 



y-y 



(3) 



where the variable y stands for redshift or absolute magnitudes. Typically we use n = 0, 
p = 6 or 8, and y and ay with appropriate values for mean redshifts and ranges, or mean 
absolute B magnitudes and ranges, which depend on the SED type. The absolute magnitudes 
of the objects are computed on the fly for the considered rest-frame SED, normalization a 
and redshift, using the standard cosmological parameters {Qm = 0.3, A = 0.7, Hq = 70 
Mpc/(km/s)). The use of fluxes fi^dat instead of the magnitudes allows us to take into 
account negative datapoints and upper limits. 

The set of galaxy templates is semi-empirical and can be optin aized through an in- 



teractive comparison with a spectroscopic dataset. The original set (IBender et al. 1 12001 



Gabasch et al. 112004 ) includes 31 SEDs describing a broad range of galaxy spectral types. 



from early to late to star-burstin g objects. Recently, we add ed a set of SEDs tailor ed to 
fit Luminous Red Galaxies (LRGs. lEisenstein et al. Il200ll ). see iGreisel et al. I (jin prepi ). and 
one SED to represent an a verage QSO spectru m. This was obtained by averaging the low 
redshift HST composite of iTelfer et al. I (120021 ) and the SDSS median quasar composite of 



Vander Berk et al. 



( I2OOII ). Furthermore, the method also fits a set of stellar templates, 
allowing a star/galaxy classification and an estimate of the line-of-sight extinction for stellar 
objects. The templates cover typically the wavelength range A = 900 A up to 25000 A (with 
the QSO template covering instead 300-8000 A) and are sampled with a step typically 10 A 
wide (varying from 5 to 20 A; the QSO SED has AA = 1 A). 

The method has bee n extensively tested an d applied to several ph o tometric catalogs with 



spectroscopic follow-up (jGabasch et al. I l2004l : iFeulner et al. I l2005l : iGabasch et al. 1 12008 



BrimiouUe et al. I lin prepI ). Given a (deep) photometric dataset covering the wavelength 



range from the U to the K band, excellent photometric redshifts with {zphot — Zspec) /(I + ^) ~ 
0.03 up to 2 ~ 5 with at most a few percent catastrophic failures can be derived for every 
SED type. With the help of appropriate priors, photometric redshifts accurate to 2% (in 
{zphot — 2spec)/(l + Zspe(^) with just 1% outllcrs (sc c Sect. [6]for definition s) are obtained for 
LRGs using the ugriz SDSS photometric dataset (IGreisel et al. I lin prepI ). This is the first 
time we attempt to determine the photometric redshifts of QSOs. We couple the available 
SED to a strong prior in luminosity that dampens its probability as soon as the predicted B 
band absolute magnitude is fainter than -24. 

The code is in C-I-+. A Fortran versio n is available as implemented under Astro- WISE 



(IValentiin et al. Il200a ISadia et al. 1120111 ). 
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5.2. The components of PCS 

Fig. [1] describes schematically the components and data flow of PCS. Each component, 
or module, is a separate unit of the package with a well deflned operational goal. Following 
the usual convention, it is indicated by a box with bars (for interfaces and/or processes, 
from A to F, plus PanZ and PanDiSC) or a cylinder (for a database) in Fig. [TJ There are 
four databases: two reside in Hawaii (PSPS, where the primary Pan-STARRSl catalogs are 
stored and a subset of the output produced by PCS is copied, and MYDB, where the whole 
of the PCS output goes). The other two are in Garching: the Master, where conflguration 
flies and templates are stored, and the local PSl DATA, where PCS input and output are 
stored. Light-blue boxes indicate interfaces to the users and the yellow box to the upper left 
the IPP system. The arrows in the flgure represent links between the components. Their 
colors code the type of link (red for input, blue for output/results, grey for conflguration 
data, cyan for a trigger). For clarity the lines joining to the C, D' and PanDiSC components 
are dotted. The paper-like symbols indicate generated data-flies. The two yellow boxes 
"Photometric catalog" and "Photometric Classiflcation Data" refer to the manual mode of 
operations, see below. 

In the normal batch mode of activities, the interfaces/processes A to F permanently run 
in the background and react to changes in the PSPS database (A) or the Master database 
in stand-alone mode. However, parallel manual sessions can be activated, where the user is 
free to use parts or all of the pipeline, adding further photometric or spectroscopic datasets 
to the local database, changing setups, testing new SEDs and recipes or deflning and using 
new training sets. Here below we flrst describe the automatic mode of operations and then 
summarize the manual options. 

The starting point is the PSPS database in Hawaii, which is fllled with catalog data 
produced by IPP. The module A periodically checks when new sets of data with all 5 band 
fluxes measured are available in PSPS. It copies tables of input data according to selectable 
input parameters to Garching. The module B detects the output of process A, ingests these 
data in the loca l database and compu tes for each object the galactic absorption corrections 



according to the lSchlegel et al. I (Il998[ ) maps. They are applied to the photometry only when 
computing photometric redshifts. The bare photometry is considered when classifying the 
objects (in PanDiSC) or when testing the stellar templates (in PanZ). Once B has flnished, 
the module C and C start to prepare PanZ and PanDiSC jobs, respectively, to analyze the 
newly available data, according to the preset conflguration flies. The input catalog is split 
into many suitable chunks to allow the triggered submission of multiple jobs on the parallel 
queue of the computer cluster (see Sect. 15. 3p . Once all the jobs are flnished, the modules 
D and D' become active. They copy the results of PanZ and PanDiSC, respectively, into 
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the local database, excepting the full redshift probability distributions for each object, that 
are too large to be written in the database directly. They are saved as separate compressed 
files. At this point the procedure E is activated. This prepares the data tables for delivery 
to Hawaii. Finally the module F signals PSPS in Hawaii, which then uploads them into the 
PSC MYDB part of the PSPS database. They can be accessed by the whole Pan-STARRSl 
community through the Pan-STARRSl Science Interface (the light blue box at the lower left 
of Fig. p. 

As a second mode of operation, the system can work in 'manual' mode, where specific 
data sets can be (re-) analyzed independently of the automatic flow. This is useful in the 
testing phase, or while considering external datasets, such as the SDSS, or mock catalogs 
of galaxies (indicated by the yellow box "Photometric catalog" of FigH]). The output of 
such manual runs is indicated by the yellow box "Photometric Classification Data" of FigJT] 
Many of the tests discussed in this paper have been obtained in 'manual' mode. Extensive 
spectroscopic datasets are available in the local database to allow an efficient cross-correlation 
with the Pan-STARRSl objects. 

The configuration files, as well the different modes of operations, can be manipulated 
through a user- friendly web interface (the light blue box at the lower right of Fig. [1]). This 
feature, initially developed to ease the life of the current small number of PCS members, 
makes the system interesting for a possible future public release. 

5.3. The implementation of PCS 

The PCS is implemented on the PanSTARRS cluster, a 175 node (each with 2.6GHz, 
4 CPUs and 6 GB memory, for a total of 700 CPUs) Beowulf machine with 180 TB disk 
space, attached to a PB robotic storing device, mounted at the Max-Planck Rechenzentrum 
in Garching. The modules described in the previous section are a series of shell scripts, or 
html/php files, executing php code or SQL commands, or running compiled C-I--I- code. 
In particular, the input/output interfaces A and F to the PSPS database make use of 
SOAP/http calls. The Schlegel maps are queried using the routines available from the 
web. The local database is based on MySQL. A set of Python scripts allows the user to 
automatically generate plots and statistics similar to Fig. H] and [51 Presently, new available 
photometry is downloaded from the PSPS database in chunks of 4 million objects. They are 
split in blocks of 25000, each of which is sent as a single job to the parallel queue. These 
numbers are subject to further optimization. The system allows parallel running of jobs 
operating on the same dataset, but with different recipes. 




Fig. 1. — The components of the PCS, see Sect. 15.21 for a description. 
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The performance (in terms of processed objects per second) of the complete system is 
summarized in the Tabled where the case of a catalog of 1.8 million objects is presented. 

The modular structure of the PCS allows implementation of further components, such 
as alternative photometric redshift codes and the modules to constrain stellar parameters 
that are under development. 

Table 3: Performance test for the PCS based on a catalog with 1.8 million objects. 



Module 


Task 


T^iiration 


Performance 


A 


Read from PSPS 


4 min 


7500/sec 


B 


File conversion 


2:10 min 


14000/sec 


B 


Input data injection 


5:20 min 


5600/sec 


C 


Extraction (18 data files) 


2:10 min 


14000/sec 


C 


Job creation (18 jobs) 


0:30 min 




PanZ 


Running jobs 


32 min 


950/sec 


D 


Results injection 


16 min 


1875/sec 


D 


PhotoZ-P compression + storage 


75 min 


400/sec 


C 


Extraction (18 data files) 


2:00 min 


15000/sec 


C 


Job creation (18 jobs) 


0:30 min 




PanDiSC 


Running jobs 


l:00min 


30000/sec 


D' 


Results injection 


15 min 


2000/sec 


E 


Results extraction 


1:40 min 


18000/sec 


F 


Signal to MyDB 






F 


Signal to PSPS 







6. PCS tests 



Several spectroscopic surveys overlap with the Pan-STARRSl MDF s, providing abun 



dant sp ectral classifications a nd redshifts: BOSS (jAihara et al. 2011, MDFl, 4, 9, 10) 



CDFS fVanzella et al. 



Lee et al. 



2001 



2008 



2006 



M DF2) SDSS and SEGUE/SDSS flEisenstein et al 



Z Cosmos ( Lilly et al. 



Abazaiian et al. II2009I MDF2 to 10), 2dF Galaxy Redsh ift Survey fIColless et al 



2001 



MD F4), VVDS ( Le Fevre et al. I I2004I. l200,4 ICariUi et al. 1 120081 MDFl, 7 and 10) 



20071 . MDF4). Here we present tests of PanDiSC and PanZ based 



on the DR7 SDSS dataset, that provides the largest available homogeneous spectroscopic 
follow-up of LRGs (selected as DR7 objects with primTarget=32 or 96), the objects for which 
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the Pan-STARRSl filter set suffices to deliver excellent photometric redshifts. Moreover, the 
SDSS dataset comprises a large enough sample of stars and QSOs to allow a global sensible 
test of the capabilities of PCS. The remaining spectroscopic surveys will be discussed in 
future Pan-STARRSl papers dealing with the science applications of the PCS. 

Table H] gives the average galactic reddening of the fields covered by SDSS (the largest 
value of E{B — V") = 0.066 mag is reached in MDF09) and summarizes the numbers of 
matched objects, typically several hundreds per field, with more than a thousand in MDF4 
and a total of 5784. The same table splits them into stars, galaxies and QSOs. The majority 
of the matches are galaxies (of which approximately a quarter are red), but numerous (of the 
order of a hundred) stars and quasars are represented per field. Fig. |2] shows the histograms 
of the magnitudes in the Pan-STARRSl filters (within an aperture of 7.4 arcsec radius) for 
the SDSS spectroscopically classified (and assumed to be the "truth") stars, galaxies and 
QSOs of the sample. As expected due to the SDSS spectroscopic limits (resulting from the 
main spectroscopic sample limited at r=17.77 and the sparser additional surveys, reaching 
fainter magnitudes), the galaxy sample peaks around gpi^ 17.5, while the QSO sample is 
^ 1.5 mag fainter. The star sample spans a broader range of magnitudes, from gpi^ 16 to 
^ 24. Given the achieved depth of the MDFs, photometric data for this sample has very large 
signal-to-noise and can be used to test the systematic limitations of PCS. In the following 
we describe tests performed using both the Pan-STARRSl and the SDSS photometry, to 
show that the Pan-STARRSl dataset is at least as good as SDSS. 

Table 4: The SDSS spectroscopic dataset used to test PCS, with number of objects split by 
star/galaxy/QSO category (LRGs are a subset of galaxies). 



MDF 


{E{B - V)) 
(mag) 


NsDSS 


Stars 


Galaxies 


LRGs 


QSOs 


3 


0.027 


704 


49 


577 


107 


78 


4 


0.026 


1125 


128 


880 


169 


117 


5 


0.008 


913 


41 


785 


155 


87 


6 


0.014 


732 


27 


628 


153 


77 


7 


0.011 


953 


104 


755 


173 


94 


8 


0.010 


226 


2 


207 


49 


17 


9 


0.066 


589 


54 


495 


95 


40 


10 


0.038 


542 


51 


436 


99 


55 


Total 




5784 


456 


4763 


1000 


565 
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Fig. 2. — The Pan-STARRSl magnitude (inside an aperture of 7.4 arcsec radius) histograms 
matched to the spectroscopic SDSS and classified through the spectroscopic SDSS informa- 
tion as stars (green), galaxies (red) and QSOs (blue). 
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6.1. Star /Galaxy /Quasar classification 

We trained and tested the PanDiSC SVM using a 10-fold cross-validation approach. We 
divided the sample described in Table H] with Pan-STARRSl photometry in 10 partitions, 
each with 45 stars, 475 galaxies and 55 QSOs. Note that they span the magnitude ranges 
shown in Fig. [2J We constructed 10 subsets by randomly selecting 55 out of the 475 galaxies. 
We created 10 training sets, by concatenating 9 of the subsets, and tested the PanDiSC SVM 
on each of the 10 partitions not used in the 10 training sets. 

Table [5] shows the results: galaxies are classified correctly in almost 97% of cases, with 
very small variations from field to field. Stars (one of which having equal probability of being 
a star, a galaxy or a QSO, and therefore not considered) and QSOs are recovered on average 
in ^ 84 % of cases. Successful star classification goes up to 95% for MDF5, and is as low 
as 75% for MDFIO. Successful QSO classification is at the 87% level in MDF4 and drops to 
69% for MDF9. We will investigate the statistically significance and possible cause of these 
variations with future larger Pan-STARRSl catalogs. 

The relatively poor performance of the stellar classification is driven by early-type stars 
alone (correctly classified in 80% of cases) and the Pan-STARRSl filter set. The classification 
of late-type stars is better, with just 4% of the late stars being wrongly classified as galaxies. 

We now look at the fraction of false positives. Only 21 stars and 44 QSOs are classified 
as galaxies, therefore the galaxy sample defined by PanDiSC is contaminated at the level of 
1%. This is not unexpected, because the extragalactic MDF number counts are dominated 
by galaxies. The purity of the star and QSOs samples are much worse. There are 38 
galaxies and 47 QSOs classified as stars, which results in a 19% contamination of the star 
catalog defined by PanDiSC. Without the contribution of galaxies, which could be flagged by 
adding morphological information (i.e. whether the objects are point-like or extended), the 



contamination (by QSOs) reduces to 10%. Preliminary tests (IKlement 1120091 ) show that this 
development is indeed very promising. There are 47 stars and 107 galaxies classified as QSOs. 
This means that the QSO catalog defined by PanDiSC is contaminated at the 28% level, 
without galaxies just 8.5%. Clearly, the situation will be different at lower galactic latitudes, 
where stars dominate the number count s at these magnitude limits. Overall, the results are 



not much worse than those reported by lElting et al. I (|2008[ ). They can be improved when 



larger Pan-STARRSl catalogs will be available, by optimizing the probability thresholds 
for making a classification decision, since they depend on the relative distribution of stars, 
galaxies, quasars in the training/test data sets used to make the assessment. 

Fig. |3] presents the Pan-STARRSl color-color plots with the distributions of stars, 
galaxies and quasars of the spectroscopic SDSS dataset. Objects that are stars according 
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to the spectroscopic SDSS classification are shown on the left diagrams, objects that are 
galaxies in the central diagrams and objects that are QSOs on the right diagrams. Objects 
classified correctly by PanDiSC are shown black. Objects wrongly classified as stars by 
PanDiSC are shown green, wrongly classified as galaxies red, wrongly classified as QSOs in 
blue. So a red dot on the left column is a star that PanDiSC classified as a galaxy. 

Clearly, misclassifications happen in regions of color space where the different types 
overlap, where only an additional filter (for example the u band) would help discriminate 
between the populations. In contrast, late- type stars are seldom misclassified thanks to their 
red colors that divide them well from galaxies and QSOs. Note that the increased scatter 
in the stellar (/pi-rpicolors at rpi-zpi> 1.8 is due to stars near or below the gpi and rpi 
magnitude limits of Table [2J 

We repeated the same tests based on the 10-fold cross-validation technique using the 
SDSS Petrosian ugriz photometry: the results are presented in Table |6l While the presence 
of the u band boosts the success rate for QSOs (up to 94%) and stars (up to 92%, with early 
type stars correctly classified in 90% of the cases) the absence of the y band and the lower 
quality of the z band penalizes marginally the galaxy classification (down to 95%) and the 
stellar classification of late star types (classified correctly in 95% of cases). The star false 
positives are up to 28% (of the total true stars), due to the higher number of misclassified 
galaxies, but would drop to just the 6% of misclassified QSOs if information about size were 
added. The QSO false positives, slightly better at the 27%, drop to just 3% without galaxies. 
The galaxy false positives stay at the 1% level. 

Table 5: SVM predictions for evaluation set using Pan-STARRSl photometry: the confusion 
matrix (first in absolute numbers, and second in fractions normalized to 1) with true classes 
in rows. 



True classes 


Ntot 


Star 


Galaxy 


Quasar 


Star 


449 


381 


21 


47 






0.849 


0.047 


0.104 


Galaxy 


4750 


38 


4605 


107 






0.008 


0.970 


0.022 


Quasar 


550 


47 


44 


459 






0.085 


0.080 


0.835 



- 19 - 



Stars 



Galaxies 



^ 1 

a, 
S-i 







OT 

N 



0. 



CL 

I 

m 
N 



- 3 ^ 



M I I I I I I I I I I I I I I I I I I I 



• •if 



I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I 



QSOs 



I I I I I I I I I I I I I I I I I I I I I 14-1 I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I 



1 _2 3 
Spsi ^psi 



12 3 
Spsi ^psi 



1 - 



^ 



-1 ~- 



12 3 
Spsi ^psi 



lll|llll|llll|llll|IIIL|JIII|llll|llll|llll|llll_|JIII|llll|llll|llll|IIIL| 



0123 0123 0123 



'"psi ^PSl 



^PSl ^PSl 



^PSl ^PSl 




-10 1 -10 1 -10 1 



^PSl '^PSl 



^PSl •^psi 



■^PSl "^PSl 



Fig. 3. — The distribution of SDSS stars (left), galaxies (middle) and QSOs (right) in the 
color-color plots of Pan-STARRSl. Objects classified correctly by PanDiSC are shown black, 
objects classified wrongly as stars are green, wrongly as galaxies are red, wrongly as QSO 
are blue. 
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Table 6: SVM predictions for evaluation set using the SDSS Petrosian magnitude ugriz pho- 
tometry: the confusion matrix (first in absolute numbers, and second in fractions normalized 
to 1) is given as fractions normalized to 1 with true classes in rows. 



True classes 


Ntot 


Star 


Galaxy 


Quasar 


Star 


450 


412 


23 


15 






0.916 


0.051 


0.033 


Galaxy 


4750 


99 


4525 


126 






0.021 


0.953 


0.026 


Quasar 


550 


25 


15 


510 






0.046 


0.027 


0.927 



6.2. Photometric redshifts 

The Pan-STARRSl photometric dataset misses the u band on the blue side of the 
spectral energy distribution, and the NIR colors redder than the y band. Therefore one 
expects any photometric redshift program to perform best for red galaxies at moderate 
redshifts (e.g., the LRGs), and to fail especially when blue galaxies at low redshifts are 
considered. Similarly, late stars are expected to be better recognized than earlier ones. 
Finally, since at present the PhotoZ program misses SEDs optimized for QSOs, we expect 
poor performances in these cases. 

Fig. m shows the comparison between spectroscopic and photometric redshifts obtained 
for the SDSS sample of LRGs, field by field. Fig. |5]shows the results for the whole sample. As 
usual, we define the percentage of catastrophic failures r] as the fraction of objects (outliers) 
for which \zphot — Zspec\ > 0.15 x {l + Zgpec), the residual bias as the mean of {zphot — Zspec)/{^ + 
Zspec) without the outliers, and the robust error as = 1.48 x Median\zphot — z sped/ + z spec) 
without the outliers. For LRGs, the Pan-STARRSl photometry allows the determination of 
photometric redshifts accurate to 2.4% in cr^, with bias smaller than 0.5%, no strong trend 
with redshift and 0.4% catastrophic outliers (when no QSO SED is allowed). No field-to-field 
dependencies are present. 

As expected, the situation is not as satisfactory for non-LRGs. Fig. |6] shows that 
especially at Zspec < 0.2 the residuals are biased in a systematic way and more than 1% 
catastrophic outliers are present. Nevertheless, the robust estimate of the scatter remains 
below 5%. 

If we now use for the same galaxies the Petrosian ugriz SDSS photometry, we find the 
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following. The photometric redshifts for LRGs are similarly good (2.6%), but with a higher 
percentage of outliers. In contrast, precision (3.7%) and percentage of outliers (1%) are 
better for blue galaxies, where the presence of the u band helps. 



As described in Sect. 15. H PanZ computes also the goodness of fits for a number of stellar 
templates. Therefore the difference xitar ~ X^aiaxy between the of the best-fitting stellar 
SED xltar the best fitting galaxy SED x^aiaxy provides a crude galaxy/star classification: 
if xltar ~ X^gaiaxy < 0' stcllar template is providing a better fit than the galaxy ones and 
we classify the object as a star. In Fig. [71 left (where just for plotting convenience we 

give xltarlxlaiaxy) ^6 show that requiring xltar - X^aiaxy < (i.e. xlar/xlaiaxy < 1 the 
plot) allows to correctly classify spectroscopically confirmed SDSS stars in 73% of the cases. 
The percentage of successful classifications grows to 89% if only late type SDSS stars are 
considered. The percentage of success is 98% if spectroscopically confirmed SDSS galaxies 
are considered (Fig. [TJ right). 

Finally, we consider the class of QSOs. Fig. IH left, shows that spectroscopically con- 
firmed SDSS QSOs are classified as QSOs (i.e. are best fit by the QSO SED) in 22% and as 
galaxies in 50% of the cases. As expected, the photometric redshifts are very poor (Fig. [HI 
right). The QSO SED in the sample is selected as giving the best-fit in 27% of the cases, 
giving the right redshift in 20% of the cases. For an additional 28% where catastrophically 
wrong redshifts are derived, the QSO SED gives the second best fit and a reasonable redshift. 
Still, if we allow only the QSO SED to be used, we get a good redshift (~ 5% in a^) for only 
61% of the objects. We are in the process of adding some more QSO SEDs to model better 
the redshift dependence of QSO evolution. First tests show that only modest improvements 
can be achieved, since we are hitting the intrinsic limitations of the Pan-STARRSl filter 
photometry, combined with the well known difficulties of deriving photomet r ic redshifts for 



the p ower-law like, feature-weak shape of QSO SEDs (IBudavari et al. Il200ll : ISalvato et al. 



201ll ). The addition of the u band certainly improves the results a lot. When we derive 
photometric redshifts using the SDSS ugriz Petrosian magnitudes, we get a best-fit with the 
QSO SED in 51% of the cases (with a photometric redshift good to 5% in 43% of the cases), 
and for an additional 19% the QSO SED gives the second best solution with the correct 
redshift. If we allow only the QSO SED to be used, we get a good redshift (~ 5% in a^) for 
70% of the objects. 

Table [7] shows the confusion matrix for PanZ as a Star/QSO/Galaxy photometric clas- 
sifier. PanZ performs as well as PanDiSC when classifying galaxies, but is poorer when 
it comes to stars and QSOs, probably due to a lack of appropriate SED templates. As a 
consequence, the false positive contamination is higher for stars (53%) and galaxies (8%) 
classes, but lower (4%) for QSOs, compared to PanDiSC. Finally, it is interesting to note 
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that PanZ biases the classification in a different way than PanDiSC: there are 29 stars and 
3 QSOs recognized as such by PanZ but not by PanDiSC. 

Finally, Table [S] shows the confusion matrix for PanZ as a Star/QSO/Galaxy photo- 
metric classifier when the SDSS Petrosian ugriz photometry is used. The percentage of 
correctly classified QSOs doubles (but is still not competitive with the results of PanDiSC) 
to reach 44%, the star classification is slightly improved to 80% and the success in the galaxy 
classification is slightly worse (85%). Therefore, the presence of the u band helps in the clas- 
sification of (blue) stars and quasars, but does not compensate the absence of the y and of 
good z band data for galaxies. 

As discussed in Sect. 16. ![ the final assessment of the relative performances of PanDiSC 
and PanZ as morphological classifiers will be made when larger Pan-STARRSl catalogs will 
allow the derivation of optimal probability thresholds. 

Table 7: PanZ as a Star/QSO/Galaxy photometric classifier: the confusion matrix in frac- 
tions normalized to 1 with true classes in rows. 

True classes Star Galaxy Quasar 

Star 0.730 0.241 0.029 

Galaxy 0.017 0.981 0.002 

QSO 0.285 0.497 0.218 



Table 8: PanZ as a Star/QSO/Galaxy photometric classifier using the SDSS Petrosian ugriz 
photometry: the confusion matrix in fractions normalized to 1 with true classes in rows. 



True classes 


Star 


Galaxy 


Quasar 


Star 


0.797 


0.166 


0.036 


Galaxy 


0.131 


0.849 


0.020 


QSO 


0.037 


0.522 


0.441 



7. Conclusions 



We presented the Photometric Classification Server of Pan-STARRSl, a database- 
supported, fully automatised package to classify Pan-STARRSl objects into stars, galaxies 
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Fig. 4. — The comparison between spectroscopic and photometric redshifts for the SDSS 
sample of LRGs. The difference {zphot — Zspec) /(I + Zspec) is shown as a function of Zgpec- The 
magenta points show the three objects for which the QSO SEDs would give the best fit (but 
a catastrophically poor photometric redshift). 
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Fig. 5. — The comparison between spectroscopic and photometric redshifts for the SDSS 
sample of LRGs. Left: the histogram of Zphot — Zspec/iX + ^spec)- Right: z^pec vs. Zphot- The 
magenta points show the three objects for which the QSO SEDs would give the best fit (but 
a catastrophically poor photometric redshift). 
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Fig. 6. — The comparison between spectroscopic and photometric redshifts for the SDSS 
galaxy sample not classified as LRG. The magenta points show the five of the six objects 
for which the QSO SEDs would give the best fit (but a catastrophically poor photometric 
redshift, two of these cases are visible in red). 
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Fig. 7. — The efficiency of the PanZ star recognition. Left: for 73% of spectroscopically 
conffimed SDSS stars PanZ finds a stellar SED as the best fit to the Pan-STARRSl pho- 
tometry (i.e. xitm- < X^aiaxy' notc that for plotting convenience ratios are shown). Of the 
remaining 123 stars, 13 are best-fit by the QSO SED. Right: PanZ finds X^aiaxy < xttar ^'^^ 
98% of spectroscopically confirmed SDSS galaxies. For 8 of these the QSO SED fits best. 
For one galaxy the best extragalactic fit is obtained by the QSO SED and is poorer than the 
one obtained using stellar templates (i.e. x^aiaxy > X%ar)- 
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Fig. 8. — PanZ performances for QSO. Left: for 29% of spectroscopically confirmed SDSS 
QSOs PanZ finds a stellar SED as the best to the Pan-STARRSl photometry (i.e. X^aiaxy > 
xitar^ note that for plotting convenience ratios are shown). Right: PanZ redshifts for 
QSOs. The red dots show the cases where the QSO SED gives the best-fit. The yellow dots 
show the second-best fit, given by the QSO SED, in cases of catastrophic failures (purple 
points), where a good redshift is obtained. 
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and quasars based on their Pan- STARRS 1 colors and compute the photometric redshifts of 
extragalactic objects. Using the high signal-to- noise photometric catalogs derived for the 
Pan-STARRSl Medium-Deep Fields we provide preliminary Star/QSO/Galaxy classifica- 
tions and demonstrate that excellent photometric redshifts can be derived for the sample 
of Luminous Red Galaxies. Further tuning of our probabilistic classifier with the large 
Pan-STARRSl catalogs available in the future will optimize its already nice performances 
in terms of completeness and purity. Applied to the photometry that the Stt survey is going 
to deliver, possibly combined with u-band or near-infrared photometry coming from other 
surveys, this will allow us to build up an unprecedented large sample of LRGs with accurate 
distances. In a future development of PCS we will include size and/or morphological infor- 
mation to improve further the object classification, implement the PanSTeP (Pan-STARRSl 
Stellar Parametrizer) software to constrain stellar parameters, and enlarge the SED sample 
to follow LRGs to higher redshifts and possibly improve results for blue galaxies and QSOs. 
Alternative photometric redshift codes could also be considered. Moreover, the indepen- 
dent classification information coming from PanZ and PanDiSG will be merged and used 
to iterate on the photometric redshifts, by narrowing down the choice of SEDs, or deciding 
which photometry (psf photometry for point-objects versus extended sources photometry for 
galaxies) is more appropriate for each object. 

Facilities: PSl (GPCl) 

The Pan-STARRSl Survey has been made possible through contributions of the Insti- 
tute for Astronomy, the University of Hawaii, the Pan-STARRS Project Office, the Max- 
Planck Society and its participating institutes, the Max Planck Institute for Astronomy, 
Heidelberg and the Max Planck Institute for Extraterrestrial Physics, Garching, The Johns 
Hopkins University, Durham University, the University of Edinburgh, Queen's University 
Belfast, the Harvard-Smithsonian Center for Astrophysics, and the Las Cumbres Observa- 
tory Global Telescope Network, Incorporated, the National Central University of Taiwan, and 
the National Aeronautics and Space Administration under Grant No. NNX08AR22G issued 
through the Planetary Science Division of the NASA Science Mission Directorate. Partial 
support for this work was provided by National Science Foundation grant AST- 1009749. 
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